SHIFTX2: significantly improved protein chemical shift prediction
نویسندگان
چکیده
A new computer program, called SHIFTX2, is described which is capable of rapidly and accurately calculating diamagnetic (1)H, (13)C and (15)N chemical shifts from protein coordinate data. Compared to its predecessor (SHIFTX) and to other existing protein chemical shift prediction programs, SHIFTX2 is substantially more accurate (up to 26% better by correlation coefficient with an RMS error that is up to 3.3× smaller) than the next best performing program. It also provides significantly more coverage (up to 10% more), is significantly faster (up to 8.5×) and capable of calculating a wider variety of backbone and side chain chemical shifts (up to 6×) than many other shift predictors. In particular, SHIFTX2 is able to attain correlation coefficients between experimentally observed and predicted backbone chemical shifts of 0.9800 ((15)N), 0.9959 ((13)Cα), 0.9992 ((13)Cβ), 0.9676 ((13)C'), 0.9714 ((1)HN), 0.9744 ((1)Hα) and RMS errors of 1.1169, 0.4412, 0.5163, 0.5330, 0.1711, and 0.1231 ppm, respectively. The correlation between SHIFTX2's predicted and observed side chain chemical shifts is 0.9787 ((13)C) and 0.9482 ((1)H) with RMS errors of 0.9754 and 0.1723 ppm, respectively. SHIFTX2 is able to achieve such a high level of accuracy by using a large, high quality database of training proteins (>190), by utilizing advanced machine learning techniques, by incorporating many more features (χ(2) and χ(3) angles, solvent accessibility, H-bond geometry, pH, temperature), and by combining sequence-based with structure-based chemical shift prediction techniques. With this substantial improvement in accuracy we believe that SHIFTX2 will open the door to many long-anticipated applications of chemical shift prediction to protein structure determination, refinement and validation. SHIFTX2 is available both as a standalone program and as a web server ( http://www.shiftx2.ca ).
منابع مشابه
Structure-Based Chemical Shift Prediction Using Random Forests Non-Linear Regression
Protein nuclear magnetic resonance (NMR) chemical shifts are among the most accurately measurable spectroscopic parameters and are closely correlated to protein structure because of their dependence on the local electronic environment. The precise nature of this correlation remains largely unknown. Accurate prediction of chemical shifts from existing structures’ atomic co-ordinates will permit ...
متن کاملProtein structure refinement using a quantum mechanics-based chemical shielding predictor.
The accurate prediction of protein chemical shifts using a quantum mechanics (QM)-based method has been the subject of intense research for more than 20 years but so far empirical methods for chemical shift prediction have proven more accurate. In this paper we show that a QM-based predictor of a protein backbone and CB chemical shifts (ProCS15, PeerJ, 2016, 3, e1344) is of comparable accuracy ...
متن کاملUsing neural networks for (13)c NMR chemical shift prediction-comparison with traditional methods.
Interpretation of (13)C chemical shifts is essential for structure elucidation of organic molecules by NMR. In this article, we present an improved neural network approach and compare its performance to that of commonly used approaches. Specifically, our recently proposed neural network (J. Chem. Inf. Comput. Sci. 2000, 40, 1169-1176) is improved by introducing an extended hybrid numerical desc...
متن کاملMicrosoft Word - predicting NMR parameters
The spectral parameters of the nuclear magnetic resonance (NMR) spectra are dependent on the chemical environment around the nuclei, making NMR spectroscopy a powerful method for studying molecular structure and dynamics at the atomic level. Conversely, the spectral parameters can be calculated if one knows the molecular structure. The spectral parameter prediction plays a key role in many appl...
متن کاملNMRFAM-SPARKY: enhanced software for biomolecular NMR spectroscopy
UNLABELLED SPARKY (Goddard and Kneller, SPARKY 3) remains the most popular software program for NMR data analysis, despite the fact that development of the package by its originators ceased in 2001. We have taken over the development of this package and describe NMRFAM-SPARKY, which implements new functions reflecting advances in the biomolecular NMR field. NMRFAM-SPARKY has been repackaged wit...
متن کامل